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NOVELTY - The column-by-column grid computation of the edit distance 
is performed between search string and computation text . If all columns 
are not computed, the prefix of the text corresponding to the columns up to 
and including the column with minimum threshold is made as forbidden prefix 
so that next computation text is set as next to present computation text 
that does not share the forbidden prefix. 

USE - For searching edit distance string e.g. to locate the medical 
history of a patient among large set of histories, to detect duplicate 
benefit payments by government agencies, and to aggregate data about a 
customer with multiple accounts at a financial institution. 

ADVANTAGE - Uses column bounding and prefix column sharing to reduce 
computation, while increasing computation speed. 
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Alerting Abstract ...NOVELTY - The column-by-column grid computation of 
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text . If all columns are not computed, the prefix of the text 
corresponding to the columns... 

USE - For searching edit distance string e.g. to locate the medical 
history of a patient among large set of. . . 
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A process determines for a search string which, if any, of the strings 
in a text list have edit distance from the search string less 
than a threshold. The process uses dynamic programming on a grid with 
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...is at least the threshold, then the prefix corresponding to that and 
previous columns causes edit distance to be at least the threshold. 
So the computation for the present text is abandoned, and computations for 
any. . . 
Claims : 

...claimed is: <b>K/b>. A method for identifying among a list of texts 
those that have edit distance from a search string that is less than 
a threshold, said method comprising: beginning with a first text from a 
list of texts as a computation text ; providing a search string and a 
threshold value; performing a column-by-column grid computation of edit 
distance between the search string and the computation text , stopping 
early if a column minimum value is at least the threshold; if the 
edit distance is computed and is below the threshold, then reporting 
that the computation text is close to the search string in edit 
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NOVELTY - A vector having length equal to number of letters in alphabet 
for construction of given word to be corrected, is computed, and the 
replacement word letters are iterated by defining the number of letters in 
replacement/given words not appearing in given/replacement words. The 
replacement word is accepted, if its actual distance from the given word is 
less than maximal edit distance . 

DESCRIPTION - An INDEPENDENT CLAIM is also included for computer-readable 
medium storing codes for correcting spelling of word or phrase in document. 

USE - For correcting spelling of words or phrases in documents. 

ADVANTAGE - The spelling of word or phrase in a document, is corrected 
quickly and exactly. 

DESCRIPTION OF DRAWINGS - The figure shows the flowchart explaining the 
steps involved for correcting the spelling of word or phrase in document. 
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followed by a "slow method" which computes the exact edit distance 
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remaining candidate words . The combination results in a method that 
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NOVELTY - Two algorithms determine degree of matching between words in a 
suffix tree ST(T) of the text T and query Q, and between the sequences of 

words . The algorithms search the data structure with queries Q in the 
form of words , sequences of words or sequences of symbols or 
combinations, so that information R is retrieved on the basis of query with 
specified degree of matching between the words and word sequences . 

DESCRIPTION - The information retrieval takes place with a given or 
varying degree of matching between query Q and retrieved information R. A 
data structure stores text T, metric M which measures the degree of 
matching between query and retrieved information. A search algorithm 
executes full text search, based on keywords. The tree structure in the 
form of suffix tree ST(T) which stores the suffixes of words , word 
sequences and symbol sequences in a text T and the metric having 
combination of edit distance metric for matching between words or 



symbols in the text and the query, and between sequences S of words and 
symbols and the query sequence P. One of the edit metric has weighting cost 
functions for edit operations, for transforming sequence S to sequence P. 
An INDEPENDENT CLAIM is also included for text information retrieval 
method . 

USE - In search system for text information retrieval. Also used to 
search documents in world wide web (WWW) . 

ADVANTAGE - Fast and efficient search and retrieval of information in 
large volume of data is enabled. Can be applied to searching and retrieving 
information stored in the form of digitalized images and graphic symbols. 

DESCRIPTION OF DRAWINGS - The figure shows schematic structure of search 
engine with search system. 
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Alerting Abstract ...a suffix tree ST(T) of the text T and query Q, and 
between the sequences of words . The algorithms search the data 
structure with queries Q in the form of words , sequences of words or 
sequences of symbols or combinations, so that information R is retrieved 
on the basis of query with specified degree of matching between the words 
and word sequences . ...tree structure in the form of suffix tree ST(T) 
which stores the suffixes of words , word sequences and symbol 
sequences in a text T and the metric having combination of edit 
distance metric for matching between words or symbols in the text and the 
query, and between sequences S of words and symbols and the query 
sequence P. One of the edit metric has weighting cost... 
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matching between words or word sequences in a suffix tree 
representation of respectively the text T and a query Q. The algorithm 
searches the data. . . 

...retrieval generates a word-spaced sparse suffix tree for storing 
suffixes of words in a text T as word sequence information, and a 
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edit operations which transform a sequence S of words or symbols 
into the sequence P, and a search algorithm for determining the degree 
of matching between words or word sequences in a suffix tree 
representation of respectively the text T and a query Q . The 
algorithm searches the data structure with the query Q, retrieving 
information with specified match to... 

...retrieval generates a word-spaced sparse suffix tree for storing 
suffixes of words in a text T as word sequence information, and a 
word size-dependent edit distance metric for word sequences S, P 

and including word -weighted cost functions for edit operations , 
and determines matches between word sequences SR or retrieved 
information R and word sequences PQ of a query Q by calculating 
the edit distance for all matches. Use in an approximate search 
engine . 



...des fonctions de ponderation des mots dans des operations de mise au 
point de textes transformant une sequence S de mots ou de symboles en 
une sequence P; et un algorithme de recherche... 

...de concordance entre des mots ou sequences de mots dans une 
representation presentant respect ivement le texte T et la question Q. 
L ' algorithme recherche la structure de donnees en posant la question Q et 
recupere 1 ' information correspondant specif iquement .. . 
...mots d 1 information, recourant a un metrique de mesure du niveau de 
concordance entre les sequences de mots S et P, comportant des 
fonctions de cout ponderees en mots pour les operations de mise 
Claims : 

. . . und Sequenzen P hieraus umfasst und wobei die Information R Worter 
und/oder Symbole und Sequenzen hieraus aus dem Text T umfasst, wobei 
das Suchsystem eine Datenstruktur zum Speichern wenigstens eines Teils des 
Textes T und ein Mass M umfasst, welches den Grad der Ubereinst immung 
zwischen der Anfrage Q und der wiedergewonnenen Information... 



...fur einen ungefahren Grad an Ubereinstimmung zwischen Wortern und/oder 
Symbolen s;q in dem Text T bzw. einer Anfrage Q und ein Edit-Abstandsmass 
Dws(S,P) fur einen ungefahren... 

. . . Gewicht-Kostenf unktion fur Editieroperationen umfasst, welche Sequenzen 
von Wortern und/oder Symbolen s in dem Text T in die Sequenz P von 
Wortern und/oder Symbolen q in der Anfrage Q t ransf ormiert , wobei die 
Gewichtung. . . 

...symbols q and sequences P thereof, and retrieved information R 
comprising words and/or symbols and sequences thereof from the text T, 
wherein the search system comprises a data structure for storing at least 

. . .words and/or symbols s and sequences S thereof in the text T, that the 
metric M comprises a combination of an edit distance metric D(s,q) 
for an approximate degree of matching between words and/or symbols s ; q 
in respectively the text T and a query Q and an edit distance metric 
Dws(S,P) for an approximate degree of matchingbetween sequences S of 
words and/or symbols s in the text T and a query sequence P of words 
and/or symbols q in the query Q, the latter edit distance metric 
including weighting cost functions for edit operations which transform 
sequences of words and /or symbols s in the text T into the sequence P 
of words and/or symbols q in the query Q, . . . 

...value proportional to a change in the length of the sequence S upon a 
transformation or dependent on the size of the words and/or symbols 
s;q in sequences S/P to be matched, that the implemented search algorithms 
comprise a first algorithm for determining the degree of matching 
between words and/or symbols s;q in the suffix tree representation of 
respectively the text T and a query Q, and a second algorithm for 
determining the degree of matching between sequences S;P of words 
and/or symbols s;q in the suffix tree representation of respectively 
the text T and the query Q, said first and/or second algorithms searching 
the data structure with queries Q in the form of either words, 
symbols, sequences of words or sequences of symbols or combinations 
thereof , such that information R is retrieved on the basis of query Q 
with a specified degree of matching between the former and the latter, and 
that the search algorithms optionally also comprise a third algorithm 
for determining exact matching between words and/or symbols s/qin the 
suffix tree representation of respectively the text T and the query Q 
and/ or a fourth algorithm for determining exact matching between 
sequences S/P of words and/or symbols s;q in... 

...text T and the query Q, said third and/or fourth algorithms searching 
the data structure with queries Q in the form of either words, 
symbols , sequences of words, or sequences of symbols or combinations 
thereof, such that information R is... 

...Systeme de recherche pour la recuperation d ? informations , en 

particulier d' informations stockees sous forme de texte, dans lequel un 
texte T comprend des mots et/ou. . . 

. . . correspondance entre la demande Q et' les informations recuperees R, et 
dans lequel le systeme de recherche met en oeuvre des algorithmes 
de recherche pour executer une recherche, en particulier une recherche 
de texte seulement sur la base de mots clefs... 

...distance d'edition DWS(S,P) donnant un degre approximatif de 



correspondance entre des sequences S de mots et/ ou de symboles s du 
texte T et une sequence de demande P de mots et... 

. . . comprenant des fonctions de ponderation de cout pour des operations 

d' edition qui transforment des sequences de mots et /ou de symboles s du 

texte Ten des sequences P de mots et/ou de . . . 

...et de la demande Q, lesdits premier et/ou deuxieme algorithmes 
recherchant dans la structure de donnees des demandes Q ayant la forme 
de mots, de symboles, de sequences de mots ou . . . ces elements, de telle 
maniere que les informations R sont recuperees sur la base d' une demande 
Q avec un degre predetermine de correspondance entre les informations et 
la demande; etles algorithmes de recherche... 

. . .pour determiner une correspondance exacte entre des mots et/ou des 
symboles s,q dans la representation sous forme d'arbre de suffixes 
respectivement du texte T et de la demande Q et/ou un quatrieme 
algorithme pour determiner une correspondance. exacte entre des sequences S, 
P de mots et/ou de symboles s,q dans la representation sous forme 
d'arbre de suffixes du texte T et de la demande Q, les troisieme... 

...A search system for information retrieval, particularly information 
stored in form of text, wherein a text T comprises words and/or symbols s 
and sequences . . . 

...wherein the query Q comprises words and/or symbols q and sequences 
thereof P, and retrieved information R comprising words and/or symbols 
and sequences thereof from the text T, wherein the search system comprises 
a data structure... 

...form of a non-evenly spaced sparse suffix tree ST(T) for storing 
suffixes of words and/ or symbols s and sequences S thereof in the 
text T, that the metric M comprises a combination of an edit distance 
metric D(s,q) for an approximate degree of matching between words and/or 
symbols s;q in respectively the text T and a query Q and an edit 
distance metric Dws ( S , P) for an approximate degree of matching between 
sequences S of words and/or symbols s in lthe text T and a query 
sequence P of words and/ or symbols q in the query Q, the latter edit 
distance metric including weighting cost functions for edit operations 
which transform sequences of words and/ or symbols s in the text 
T into the sequence P of words and/or symbols q in the query. . . 
...with a value proportional to a change in the length of the sequence S 
upon a transformation or dependent on the size of the words and /or 
symbols s;q in sequences S;P to be matched, that the implemented 
search algorithms comprise a first algorithm for determining the degree of 
matching between words and/or symbols s;q in the suffix tree representation 
of respectively the text T and a query Q, and a second algorithm for 
determining the degree of matching between sequences S;P of words and/or 
symbols s ; q in . . . 

...text T and the query Q, said first and/or second algorithms searching 
the data structure with queries Q in the form of either words, 
symbols, sequences of words or sequences of symbols or combinations 
thereof, such that information R is retrieved on the basis of query Q 
with a specified degree of matching between the former and the latter, and 
that the search algorithms optionally also comprise a third algorithm for 
determining exact matching between words and/or symbols s;q in the 
suffix tree representation of respectively the text T and the query Q 
and/ or a fourth algorithm for determining exact matching between 
sequences S;P of words and/or symbols s;q in the suffix tree 



representation of respectively the text T and the query Q , said third 
and/or fourth algorithms searching .the data structure with queries 

in the form of either words, symbols , sequences of words, or 
sequences of symbols or combinations thereof, such that information R is 
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String matching method for matching between query and candidate strings - 
calculating lower bound estimate of string edit distance between two 
strings by equalising lengths of strings by adding padding elements to 
shorter one, sorting elements and summing substitution costs between 
corresp. elements 
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Alerting Abstract EP Al 

The matching method calculates a lower bound estimate of the string edit 
distance (46) between the strings by equalising the lengths of the two 
strings by adding padding elements to the shorter string. The query string 
and the candidate string (43) are sorted according to the value of their 
elements. The sum of the substitution costs of the elements -in corresp. 
positions in the two strings is calculated. The candidate best match is 
stored . 

The lower bound distance for each of the remaining candidate strings is 
calculated. If it is greater than the current one, then that string is 
discarded. The current best match string or the current candidate string is 
retained depending on which one has the lower string edit distance from 
the query string as the current best match candidate string. This string is 
then stored. 

USE - In string matching using lower bound estimate of string edit 
distance . 

ADVANTAGE - Provides improved method of string matching reducing amount 
of computation required to perform string matching. 



Title Terms/Index Terms/Additional Words: STRING; MATCH; METHOD; QUERY; 
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...calculating lower bound estimate of string edit distance between two 
strings by equalising lengths of strings by adding padding elements to 
shorter one . . . 

Alerting Abstract ...The matching method calculates a lower bound 
estimate of the string edit distance (46) between the strings by 
equalising the lengths of the two strings by adding padding... 

...or the current candidate string is retained depending on which one has 
the lower string edit distance from the query string as the current 
best match candidate string. This string is then... 

...USE - In string matching using lower bound estimate of string edit 
distance . 



Original Publication Data by Authority 



Original Abstracts: 

...a query string against a plurality of candidate strings replaces a 
highly computationally intensive string edit distance calculation with 
a less computationally intensive lower bound estimate. The lower bound 
estimate of the string edit distance between the two strings is 
calculated by equalising the lengths of the two strings by adding padding 
elements to . . . 

...a query string against a plurality of candidate strings replaces a 
highly computationally intensive string edit distance calculation with 
a less computationally intensive lower bound estimate . The lower bound 
estimate of the string edit distance between the two strings is 
calculated by equalising the lengths of the two strings by adding 
padding elements to the shorter one. The elements... 
Claims : 

...against a candidate string, the method comprising calculating a lower 
bound estimate of the string edit distance between the strings 
and characterised in that calculating the lower bound estimate 
comprises : </br> - equalising the lengths of the two strings by adding... 

...one or more candidate strings, wherein this matching is determined in 
accordance with a string edit distance , wherein the string edit 
distance is a minimum sum of costs of edit operations required to 
transform one string into another; characterised in that the step of 
matching the query string against the one or more candidate strings 
comprises the step of calculating the lower bound estimate of the string 



edit distance between the query string and at least one of the 
candidate strings, wherein the step of calculating the lower bound estimate 
of the string edit distance between the query string and a candidate 
string comprises : </br> - equalising the lengths of the two strings by 
adding padding elements to the shorter string (63);</br> - sorting the 
two strings according to the value of their internal elements 
(64);</br> - calculating the sum of the substitution costs of the 
elements in . . . 

...65), the sum of the substitution costs being the lower bound estimate of 
the string edit distance . 



. . . A method for calculating a lower bound estimate of string edit distance 
between query string and a candidate string, the method 
comprising : equalising lengths of the strings... 

...the query string and the candidate string according to their element 
values ; calculating a sum of substitution costs of the elements in 
corresponding positions in the sorted strings, the sum of the substitution 
costs being the lower bound estimate of string edit distance. > 
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Spelling correction method for computer documents by comparing edited words 
generated with words stored in dictionary and substituting misspelt with 
unique candidate word 
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Alerting Abstract WO Al 

NOVELTY - The minimum edit distance is calculated using a restricted 
set of edit operations which correct common errors comprising insertion, 
deletion, transposition and/or substitution after comparison with a 
dictionary of valid words. A unique candidate word for misspelled word is 
substituted after comparing edited word generated in the preceding step. 

USE - For computer documents. 

ADVANTAGE - It reduces the time required for correcting spelling error. 

Title Terms/Index Terms /Additional Words: SPELLING; CORRECT; METHOD; 

COMPUTER; DOCUMENT; COMPARE; EDIT; WORD; GENERATE; STORAGE; DICTIONARY; 
SUBSTITUTE; MISSPELT; UNIQUE; CANDIDATE 
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SPELLING CORRECTION METHOD USING IMPROVED MINIMUM EDIT DISTANCE 
ALGORITHM. . . 

...NOVELTY -The minimum edit distance is calculated using a 
restricted set of edit operations which correct common errors comprising 
insertion . . . 



Original Publication Data by Authority 



Original Abstracts : 

...method of spelling correction comprises the steps of: a) storing a 
dictionary of valid words, b ) for each input string to be checked 
comparing the input string to words in the stored dictionary to 
identify input strings not in the dictionary , c) for each input 
string not found in the preceding step, generating test words by. . . 

...operations which correct the most common errors comprising insertion, 
deletion, transposition and/or substitution, d) comparing the edited 
input string generated in the preceding step with words stored in th 

dictionary and e) generating a candidate word or candidate list of the 
words . . . 
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Computer implemented method for identifying patterns in transaction 
sequences - involves identifying which large sequences are not subsets of 
other large sequences and returning maximal large sequences to indicate 
recurring purchasing patterns over time 
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Alerting Abstract EP A2 

The method involves entering an item set into a large set of item sets 
when the number of items is present in the database exceeds a predefined 
minimum support value. A forward set of large sequences is defined and 
sequences are concatenated in accordance with a predetermined concatenation 
regime to generate a next set of candidate large sequences. 

Each sequence in the next set of candidate large sequences is compared to 
the sequences in a transformed set of sequences. A candidate large sequence 
is entered into a next forward set. The set of maximal large sequences is 
output for identifying particular transaction sequences over time. 

ADVANTAGE - Allows for quick mining of large databases which is easy to 
use and cost-effective. 

Title- Terms /Index Terms /Additional Words: COMPUTER; IMPLEMENT; METHOD; 
IDENTIFY; PATTERN; TRANSACTION; SEQUENCE; SUBSET; RETURN; MAXIMUM; 
INDICATE; RECURRENCE; PURCHASE; TIME 
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Original Publication Data by Authority 
Claims : 

...large sequences when the number of times the candidate large sequence is 
present in the transformed set of sequences is greater than the 
minimum support value; and</br> ( f ) outputting the set of maximal large 
sequences for identifying particular transaction sequences over time... 

...38) an itemset into a set of large itemsets when the number of times the 
itemset is present in the database exceeds a predefined value; 
<b>characterised by</b> : generating (40) a transformed set of sequences 
by discarding a transaction when the transaction does not include an 
itemset in. . . 

...does not include an itemset in the set of large itemsets; andgenerating 
(42) a set of large sequences which appear in the database a number 
of times exceeding the predefined value, including the ...large sequences 
and concatenating sequences in the forward set of large sequences to 
generate a next set of candidate large sequences; ( d ) comparing each 
sequence in the next set of candidate large sequences to the sequences in t 
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Molecular sequence aligning method for microbiological applications , 
involves computing gap effect in individual or prealigned multiple 
sequences, and symbol jumping effects in prealigned sequence 
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Alerting Abstract EP Al 

NOVELTY - The matching or coincidence between symbols of individual 
molecular sequence, and prealigned multiple molecular sequences are 
determined at evaluation position. The effect of introducing a gap as a new 
position in any one sequence, and the effect of symbol jumping from one 
prealigned sequence to the other are computed to determined the optimum 
individual alignment. 

USE - For aligning of sequences e.g. nucleotide and peptide sequences in 
microbiology . 

ADVANTAGE - The computation of the gap introducing effects, the alignment 
of the molecules are simplified. The time complexity associated with the 
calculation of jumping effects is reduced, by calculating the jump effects 
from one sequence to the other, which is the same for all sequences. Hence, 
the need for calculating the best jump individually for each sequence is 
eliminated. 

DESCRIPTION OF DRAWINGS - The figure shows the graphical representation 
of an edit matrix . 

Title Terms/Index Terms /Additional Words: MOLECULAR; SEQUENCE; ALIGN; 

METHOD; MICROBIOLOGICAL; APPLY; COMPUTATION; GAP; EFFECT; INDIVIDUAL; 
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Alerting Abstract ...DESCRIPTION OF DRAWINGS - The figure shows the 
graphical representation of an edit matrix . 

Original Publication Data by Authority 



Original Abstracts: 

...candidate sequence and a number of prealigned seed sequences; wherein 



the sequences are defined by a consecutive string of characters , each 

of which belongs to a group of symbols including a gap symbol; with 
the following step: -Finding an optimal alignment between... 

...the candidate sequence the best fit with a corresponding position of one 
of the seed sequences ; characterised in that for determining the 
optimal individual alignment at each position under evaluation, the 
following parameters are used: -the match between the symbol... 
Claims : 

...and a number of prealigned seed sequences; wherein the sequences are 
defined by a consecutive string of characters , each of which belongs 
to a group of symbols including a gap symbol; with the following step: 
finding an optimal alignment between... 

...the candidate sequence the best fit with a corresponding position of one 
of the seed sequences ; <b>characterised in that</b> for determining 
the optimal individual alignment at each position under evaluation , 

the following parameters are used: the match between the symbol of the 
candidate sequence and. . . 
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Method for calculating phonetic distance among different phonetic spelling 
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Alerting Abstract KR A 

NOVELTY - A phonetic distance calculation method is provided to perform a 
phonetic distance among different spellings on an adopted word so that it 
can enhance an efficiency of an information retrieval system. 

DESCRIPTION - The method comprises steps of a system manager defining 
consonant and vowel change patterns which can occur among the different 
phonetic spellings on an adopted word(Sl), the system generating the 
different phonetic spellings based on the defined consonant and vowel 
change patterns and granting a penalty on each different phonetic 
spelling (S2 ) , the . system calculating a minimum phonetic distance between 
the two different phonetic spellings by applying a minimum edition 
distance calculation method(S3), and the system determining the different 
phonetic spelling with the shortest phonetic distance ( S4 ) . The consonant 
and vowel change patterns is classified into a transposition of the 
consonants and vowels, an insertion/deletion of the consonants and vowels, 
and an extension/compression of the consonants and vowels. 

Title Terms/Index Terms /Additional Words: METHOD; CALCULATE; PHONETIC- 
DISTANCE; SPELLING; ADOPT; WORD 

Class Codes 

International Classification (Main) : G06F-017/20, G10L-015/10 
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Alerting Abstract ...calculating a minimum phonetic distance between the 
two different phonetic spellings by applying a minimum edition distance 
calculation method (S3), and the system determining the different phonetic 
spelling with the shortest phonetic... 
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Original Abstracts: 

A phonetic distance calculation method for similarity comparison between 
phonetic transcriptions of foreign words. A system manager defines . 
character element transformation patterns occurrable between phonetic 
transcriptions derived from the same foreign language. A system generates 
new phonetic transcriptions according to the defined character element 
transformation patterns and assigns a demerit mark to each of the 
generated phonetic transcriptions according to a phonetic distance. A 
minimum. . . 

. . .phonetic transcriptions and a given phonetic transcription is calculated 
on the basis of a minimum edit distance calculation method. Any one 
of the generated phonetic transcriptions with a smallest one of the 
calculated minimum phonetic distances... 
Claims : 

...1. A phonetic distance calculation method for similarity comparison 
between phonetic transcriptions of foreign words, comprising the steps 
of: a) defining, character element transformation patterns occurrable 
between phonetic transcriptions derived ' from the same foreign language, 
by classifying said patterns into three types, substitution of one 
character element with a different one, insertion or deletion of one 
character element, and expansion of one character... 

...into two or contraction of two consecutive character elements into one, 
wherein said expansion/contraction patterns are restricted to 
character element combinations as following table , classifying said 
three types into consonants and vowels, and then classifying said 
consonants into final and initial consonants, Patterns 
Character element combinations Final/initial + */ E-, -*/ -*/ 

-*/ vowel -*/ — , -*/ - Vowel/vowel + vowel -/ -, -/ -, -/ - , 

-/ -, -/ -, -/ - assigning a demerit mark to each of 

said patterns, and storing said patterns and said demerit marks into a 
database; b ) taking a phonetic transcription input, and generating new 
phonetic transcriptions for said input and assigning... 

...between each of said new phonetic transcriptions and said input on basis 
of a minimum edit distance calculation method, wherein using said 
demerit marks; and d) considering any one of said new 



Set Items Description 

51 87348 (WORD??? OR CHARACTER? OR TERM? ? OR TEXT? ? OR ALPHABET? - 

OR SPELLING? OR LETTER?) (3N) (STRING? OR SEQUEN? OR PATTERN? OR 
COMBINATION?) 

52 5678 S1(7N) (MONITOR? OR EXAMIN? OR DETECT? OR UNCOVER? OR REVEA- 

L?'OR ASSESS? OR EVALUAT? OR INSPECT?) 

53 13414 S1(7N) (DETERMIN? OR COMPAR? OR DISCERN? OR ASCERTAIN? OR A- 

NALY? OR IDENT? OR CHECK? OR VERIF? OR JUDG???) 

54 8226 S1(5N) (SET? ? OR GROUP? OR CLUSTER? OR BUNCH? OR COLLECTIO- 

N? OR AGGREGAT? OR SAMPL? OR DICTIONAR?) 

55 380681 (MAXIMUM? OR MAX OR TOP OR HIGH? OR GREAT? OR PEAK??? OR P- 

ENULTIMAT? OR FULL??? OR MOST?? OR LAST?? OR THRESHOLD? OR TH- 
RESHHOLD? OR BRINK) (5N) (NUMBER? OR AMOUNT? OR COUNT? OR SUM OR 
TOTAL? OR TALLY? ? OR QUANTIT? OR MANY OR KNOWN OR SIZE? ?) 

56 10341 S5(7N) (.EDIT??? OR LEVENSHTEIN? OR HAMMING? OR PARE? ? OR P- 

ARING OR INSERT? OR DELET? OR TRANSFORM? OR CHANG???) 

57 97302 (STOP??? OR STOPPING OR CEASE?? OR CEASING OR END??? OR HA- 

LT??? OR CESSAT?) (3N) (COMPUT? OR DETERMIN? OR TOTAL? OR TABUL- 
AT? OR CALCULAT? OR PROCESS? OR FIGUR? OR FORMULAT?) 



S8 


2755 


S2:S3 AND S4 


S9 


176 


S8 AND S5 


S10 


5 


S9 AND S6 


Sll 


28 


S9 AND (MATRIX? OR GRID? OR TABLE? OR MATRIC?) 


S12 


0 


Sll AND (EDIT??? OR LEVENSHTEIN?) 


S13 


98 


S8 AND (EDIT??? OR LEVENSHTEIN?) 


S14 


11 


S13 AND (MATRIX? OR GRID? OR TABLE? OR MATRIC?) 


S15 


11 


S14 NOT S10 
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